Kernel mean embeddings are a powerful tool to represent probability distributions over arbitrary spaces as single points in a Hilbert space. Yet, the cost of computing and storing such embeddings prohibits their direct use in large-scale settings. We propose an efficient approximation procedure based on the Nystr\"om method, which exploits a small random subset of the dataset. Our main result is an upper bound on the approximation error of this procedure. It yields sufficient conditions on the subsample size to obtain the standard $n^{-1/2}$ rate while reducing computational costs. We discuss applications of this result for the approximation of the maximum mean discrepancy and quadrature rules, and illustrate our theoretical findings with n...
Many interesting machine learning problems are best posed by considering instances that are distribu...
Embeddings of probability measures into reproducing kernel Hilbert spaces have been proposed as a st...
Abstract Kernel methods represent one of the most powerful tools in machine learning to tackle probl...
Kernel mean embeddings are a popular tool that consists in representing probability measures by thei...
Maximum mean discrepancy (MMD) is a kernelbased distance measure between probability distributions. ...
This tutorial will give an introduction to the recent understanding and methodology of the kernel me...
We provide a theoretical foundation for non-parametrically estimating functions of random variables ...
Embeddings of probability measures into reproducing kernel Hilbert spaces have been proposed as a st...
We provide a theoretical foundation for non-parametrically estimating functions of random variables ...
We analyze the Nyström approximation of a positive definite kernel associated with a probability mea...
While kernel methods are the basis of many popular techniques in supervised learning, they are less ...
While kernel methods are the basis of many popular techniques in supervised learning, they are less ...
We examine the problem of approximating the mean of a set of vectors as a sparse linear combination ...
We investigate the efficiency of k-means in terms of both statistical and computational requirements...
Provides a comprehensive review of kernel mean embeddings of distributions and, in the course of doi...
Many interesting machine learning problems are best posed by considering instances that are distribu...
Embeddings of probability measures into reproducing kernel Hilbert spaces have been proposed as a st...
Abstract Kernel methods represent one of the most powerful tools in machine learning to tackle probl...
Kernel mean embeddings are a popular tool that consists in representing probability measures by thei...
Maximum mean discrepancy (MMD) is a kernelbased distance measure between probability distributions. ...
This tutorial will give an introduction to the recent understanding and methodology of the kernel me...
We provide a theoretical foundation for non-parametrically estimating functions of random variables ...
Embeddings of probability measures into reproducing kernel Hilbert spaces have been proposed as a st...
We provide a theoretical foundation for non-parametrically estimating functions of random variables ...
We analyze the Nyström approximation of a positive definite kernel associated with a probability mea...
While kernel methods are the basis of many popular techniques in supervised learning, they are less ...
While kernel methods are the basis of many popular techniques in supervised learning, they are less ...
We examine the problem of approximating the mean of a set of vectors as a sparse linear combination ...
We investigate the efficiency of k-means in terms of both statistical and computational requirements...
Provides a comprehensive review of kernel mean embeddings of distributions and, in the course of doi...
Many interesting machine learning problems are best posed by considering instances that are distribu...
Embeddings of probability measures into reproducing kernel Hilbert spaces have been proposed as a st...
Abstract Kernel methods represent one of the most powerful tools in machine learning to tackle probl...